Creating a Workflow for Expressed Sequence Tags Analysis
نویسندگان
چکیده
Expressed sequence tags (ESTs) are short sequence fragments of genes and may be used in genomic and genetic investigations. Despite rapid expansion of EST generation process, the resulting sequences are relatively low quality fragments and need to be cleaned before assembling into a larger sequence by identifying overlaps between sample sequences. EST comparative analysis and functional assignment then should be performed to characterize gene annotation and classification, and describe gene functions. In this study we reported the establishment of a workflow for analysis and assembly of ESTs sequences into contigs and singlets and implementation of an EST database. High quality assembled ESTs were annotated using BLASTX through our local BLAST server. We searched several databases including the NCBI non-redundant protein databases. The BLAST results were automatically extracted and transferred into a relational database. We used well annotated Gene Ontology (GO) information to characterize gene function annotation and to classify molecular function, biological processes, and cellular communication. Pathway analysis based on Kyoto Encyclopedia of Genes and Genomes (KEGG) classification has been used for pathway mapping. Enzyme commission (EC) numbers were used to determine which sequences pertained to a specific pathway.
منابع مشابه
P-215: Discovery of A Novel APA Variant of A Human Potential Gene Based on Expressed Sequenced Tags Analysis
Background: Expressed sequence tags (ESTs) are sequences of cDNA fragments prepared from different tissue sources. There are over one million of these sequences in the publicly available database, and these sequences are believed to represent more than half of all human genes. The ESTs belong to different cDNA libraries, was prepared from one particular cell type, organ, or tumor. Therefore, th...
متن کاملDetermination of Genetic diversity of cultivated chickpea (Cicer arietinum L.) using Medicago truncatula EST-SSRs
Expressed sequence tags simple sequence repeats (EST-SSRs) are important sources for investigation of genetic diversity and molecular marker development. Similar to genomic SSRs, the EST-SSRs are useful markers for many applications in genetics and plant breeding such as genetic diversity analysis, molecular mapping and cross-transferability across related species and genera. In spite of low po...
متن کاملReliable In Silico Identification of Sequence Polymorphisms and Their Application for Extending the Genetic Map of Sugar Beet (Beta vulgaris)
Molecular markers are a highly valuable tool for creating genetic maps. Like in many other crops, sugar beet (Beta vulgaris L.) breeding is increasingly supported by the application of such genetic markers. Single nucleotide polymorphism (SNP) based markers have a high potential for automated analysis and high-throughput genotyping. We developed a bioinformatics workflow that uses Sanger and 2n...
متن کاملComputational Identification of Micro RNAs and Their Transcript Target(s) in Field Mustard (Brassica rapa L.)
Background: Micro RNAs (miRNAs) are a pivotal part of non-protein-coding endogenous small RNA molecules that regulate the genes involved in plant growth and development, and respond to biotic and abiotic environmental stresses posttranscriptionally.Objective: In the present study, we report the results of a systemic search for identifi cation of new miRNAs in B. rapa using homology-based ...
متن کاملRED: the analysis, management and dissemination of expressed sequence tags
The Rancourt EST Database (RED) is a web-based system for the analysis, management, and dissemination of expressed sequence tags (ESTs). RED represents a flexible template DNA sequence database that can be easily manipulated to suit the needs of other laboratories undertaking mid-size sequencing projects.
متن کامل